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ABSTRACT 

Included in this_ summary analysis of three journal 
articles and two documents on teacher evaluation are reports that 
touch on difficulties facing school officials in developing a teacher 
evaluation system and suggestions for assessing the strengths and 
weaknesses of an evaluation system. The first article, by Susan S. 
Stodolsky, challenges evaluation methods that rely on classroom 
observation alone and emphasizes -the importance on the entire 
instructional context. The second, a document written by Rand 
Corporation researchers, concludes from a study of four exemplary 
school districts that the teacher evaluation process is inseparable 
from the school district's larger organizational context. The three 
remaining selections focus on practical aspects of implementing a 
teacher evaluation system. In his guide, Thomas L. McGreal identifies 
nine "commonalities" of effective systems that support his conviction 
that evaluation should help, rather than judge, teachers. Taking the 
position that some form of summative evaluation is nonetheless needed 
for making personnel decisions, James Raths and Hallie Preskill offer 
recommendations to guide administrators through the task of teaching 
staff assessment. Because of the potential legal implications brought 
with any personnel decision, the final selection by Donovan Peterson 
provides, guidelines for ensuring that a teacher evaluation system 
^ will withstand judicial scrutiny. (CJH) 
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Thomas I. Ellis 

T< he current educational 
reform movement has 
thrown a spotlight on 
teacher competency, leading 
policy makers to pursue a variety 
of measures ranging from merit 
pay and career ladders to tenure 
review and mandatory dismissal 
of those teachers who are demon- 
strably incompetent. All such prop- 
osals inevitably iinderscpre the 
need for an effective, reliable, and 
legally defensible system of teacher 
evaluation. 

Teacher evaluation need not be 
what it too often becomes; an essen- 
tially meaningless formality re- 
S^de^ with suspicion arid even 
contempt by teachers arid as frus- 
trating by supervisor?. Moreover, it 
does not have to be a source of 
contention between teachers and 
administrators. If a teach er evalua- 
tion system is research-based, 
designed to improve instruction, 
and approached with a cooperative 
attitude hy ail parties, it^can be an 
effective and dynamic agent for 
educational renewal. 

In developing a teacher evalua- 
tion system, school officials must 
confront two fundamental difficul- 
ties. The flret is that no completely 
objective approach^ to assessing^ 
teacher performance has ever been 
found (though in recent years 
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researchers have been making 
gains on Jtriis problem). . Indeed, it 
has become ;ixiomatic_that the_ 
more explicit or standardized the 
criteria become, the less valid they 
are for assessing a particular 
teacher s effectiveness. 
_Jhe other problem is that the two 
major purposes^ of teacher evalua- 
tion— helping teachers Jo become 
better at their job and providing a 
basis for making personnel deci- 
sioris^are usually perceived as 
being at odds with each other. 
?Ife c A ive :f?™2tive evaluation re : 
quires a relationship marked by 
mutual trust between teachers and 
supervisors. Yet such a relationship 
is hardly possible if the teacher 
feels that the evaluation process 
may lead to an adverse personnel 
decision or if the supervisor fears 
that such a decision may lead to a 
court suit: 

Five recent reports that touch on 
these difficulties provide useful 
guidelines for assessing the _ 
strengths arid weakriesses of just 
about any school's teacher evalua- 
tion system: The first, by Susan S. 
Stcnioisky. challenges the validity of 
evaluation methods that rely on 
classroom observation alone and 
emphasizes the importance 6f_ 
looking at the whole context of 
instruct matter arid 

activity structure as they relate to 



teaching style) when evaluating 
teacher performance. 

Iri the second, several Rand Cor- 
poration researchers concluded 
f rnm j hMr sti i dy o f fni i f p xp rh pi a ry 
school districts that the teacher 
evaluation process is inseparable 
from the larger organizational 
context of the school district. Find- 
ings from the study suggest that 
organizational commitment and 
? ta *I Jnyofreroent are essential for 
effective teacher eyaluat ion , regard- 
less of the methods tised, __ 

The remaining three selections 
focus on practical aspects of imple- 
meritirig a teacher evaluation sys- 
tem. Iri his cbmpreherisive guide 
based on extensive experience, 
ThomasJ.. McGreal identifies riirie 
"commonalities" of effective sys- 
tems, all in keeping with his convic- 
tion that the primary purpose of 
evaluation should be to help, rather 
thari to judge, teachers. 

Turn ing to the proposition that 
some form of summative evaluation 
is_ nonetheless needed for making 
personnel decisions, dames Raths 
arid Hallie Preskill offer some re- 
commendations to guide adminis- 
trators through the delicate task i f 
deciding which members of the 
teaching staff are excellent, satis- 
factory, or incompetent: 
_ Because any personnel decision 
based oh teacher evaluation is 
fraught with potential legal trou- 
bles, the final selection by Peterson 
offers practical guidelines for en- 
suring that a teacher evaluation 
system will withstand judicial 
scrutiny. 
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1 Stodolsky, Susan S. 
teacher Evaluation: 
The Limits: of Look- 
ing. " Educa iiohal Researcher 
13; 9 (November 1984), 11- 
18: EJ 309 391: 

Most teacher evaluations rest on 
a few direct observations, usually 
unspecified as to situation. The 
%^J? ^^y^P^^J 5 Jtl«t_effectiye 
teaching can be reduced to a set of 
generalized behaviors that are 
consistent across teaching situa- 
tions and occasions. 

Stodolsky challenges this as- 
sumption. Wielding both concep- 
tual arguments and empirical data, 
she shows that the effectiveness of 
any teaching style or behavior 
depends on the context in which 
the teaching takes place. Hence, 
evaluations based on a small 
number of classroom observations 
canno^ do justice to the range of 
teaching behaviors, skills, and 
arrangements used for different 
purposes by teachers, particularly 
at the elementary level. 

Stodolsky proposes a flexible 
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"act ivj ty stmcture" approach , t led 
to subject matter and curriculum^, 
as a more accurate characterization 
of elementary school teaching, the 
evaluator assises how appropriate 
the teaching style arid approach are 
to the subject matter at hand and 
to the skill level of the students. To 
do this, the evaluator must describe 
how activities are structured, who 
is present, how long the activity 
lasts, arid what its instructional 
purpose or format is. 

The unit of analysis in this 
method is riot the usual random 
selection of different classes, but 
rather what Stodolsky labels "activ- 
ity segments." Each activity seg- 
ment comprises a particular set of 
instructional formats, particip- 
ants, materials, behavioral expecta- 
tions arid goals, arid time bound- 
aries. The evaluator takes all of 
these variables into account in 
determining the appropriateness 
or effectiveness of any given in- 
structional strategy. Such an ap- 
proach does justice, it is pointed 
out, to the nature of teaching as a 
context-bound activity. 

A study of fifth-grade math and 
social studies classes in Chicago 
tested Stodolskys proposed ap- 
proach. The finding-indicate that 
one should expect systematic vari- 
ation in teaching and instructional 
arrangements, riot consistency. 
The same teachers teaching the 
same children in the same physical 
sj^AMllLV*? very different instruc- 
tional arrangements as they 
switched from math to social 
studies. Mathematics instruction 
proved to be homogeneous within 
arid across classrooms, whereas 
social studies instruction charac- 
teristically tends to be diverse. 
Stodolsky proceeds to describe in 
detail three key features of the 
activity segments studied: instruc- 
tional format, pacing, and cognitive 
level. 

The data indicate that elementary 
school teachers are esseiitially 
generalises who create aJ)road_ 
repertoire of organizational and 
pedagogical arrangements. Thus 
assumptions of Internal consis- 
tency made in cbririectibn with 
*^ch^?^uation procedures 
must^sertously be questioned. 
Stodolsky therefore concludes that 



evalUatdrs should exarrnrie teach 
ing within an overall cdniext=tak - 
J Jig i nt q accou n t the su bjec t rriia 1 1 e r 
and the structure and purpose qj 
classroom activities — rather than 
simply identifying the presence or 
abserice pf a list of teaching "be- 
haviors "judged to be desirable. She 
concedes, however, that such a 
compiex and extensive task for the 
study of teachers and teach ing wili 
not win easy converts: 




Wise, Arthur E. , and 
others. Teacher 
Evaluation: A Study 
of Effective Practices. Santa 
Monica, California: Rand 
Corporation. Sponsored by 
the National Institute of 
Education, Washington; 
DC, June 1984. 101 pages. 
ED 246 559. 

Teacher evaluation does noi 
occur in a vacuum. Wise and his 
colleagues note; it is shaped by the 
organizational, political, and in- 
structional context in which it 
takes place. So how do these contex- 
tual factors influence the quality of 
teacher evaluation programs? To 
find out, the Rand Corporation 
conducted case studies in four 
school districts with highly effec- 
tive, but very different, teacher 
evaluation programs: Salt Lake 
City. Utah: Lake Washington, 
Washington: Greenwich, Connec- 
ticut: and Toledo. Ohio. Despite 
their varying approaches, certain 
common characteristics set these 
systems apart from less successful 
ones. 

The first common factor was 
organizational commitment — the 
willingness of the districts' top-level 
leadership to devote adequate time, 
personnel, and institutional re- 
sources to teacher evaluation. The 
second common factor was the 
recognized competence of the 
evaluators in making judgments 
and recommendations, coupled 
with mechanisms for cross-verifica- 
tion of their accuracy. 

Third, in all these districts the 
teachers and administrators col- 
laborated to develop a common 
understanding of the processes to 
be used in the evaluation arid the 
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goals involved: Fourth; the evalua- 
tion process and support systems 
were compatible with each. other 
and with the district s overall goals 
and organ izatidnal cont ext. 

Perhaps the most significant 
finding to emerge from this study 
is t h<* importance of teacher part ici- 
pation in developing and carrying 
otit the evaluation process. The 
authors recommend that school 
districts involve expert teachers in 
peer support arid assistance, that 
they involve teacher organizations 
in thedesign and oversight phases, 
and that they adopt a professional 
(as opposed to bureaucratic) ap- 
proach to evaluation. Teachers 
should be held accountable riot to 
arbitrary arid hierarchically en-_ 
forced criteria, they note: but rather 
to constantly evolving standards of 
practice developed by consensus 
among the teachers themselves. 

McGfeal, Thomas L. 

Successful Teacher 
Evaluation. Alexan- 
dria, Virginia: Association 
For Supervision anc^Cur- 
rieulum Development, 
1983. 175 pages. ED 236 
776. 

Thomas McGreal has worked 
with some 300 school districts and 
75.000 supervisors and teachers 
around the country to build local 
teacher evaluation systems. In 
keeping with his conviction that 
teacher evaluation should be tail- 
ored to the specific needs of local 
districts. McGreal does not attempt 
to "sell" one approach or another. 
Rather, he organizes his book _ 
around eight "commonalities" of 
effective practices. 

The first three commonalities 
constitute the framework for build- 
in^g a teacher evaluation system: an 
appropriate atti tude, an evaluation 
model attuned to the desired pur- 
pose, and the separation of ad : 
mihistrative and supervisory be- 
havior. Evaluation. McGreal in- 
sists, should ^ive^iiniarily to help 
teachers^ become more effective 
rather than to obtain documenta- 
tion for personnel decisions. Sys- 
tems that emph^izeaccbimtabilityr 
tend to promote hostility between 
teachers and administrators, he 



says, whereas systems geared to 
ir.iprbvjrig instruct ion will cultivate 
riilitual trust while still providing 
adequate information for assessing 
minimum^ competence when such 
assessment becomes necessary. 

The next foUr cdmmdnaiities are 
grouped as "focusing activities." 
The first is goal-setting as a 
cooperative activity between 
teacherand supervisor. Evaluation 
criteria that result from cooperative 
goal setting are more likely to fit the 
teacher s needs and methods, 
McGreal notes, whereas the alterna- 
tive— -imposition of criteria by the 
supervisor — forces all teachers to 
conform to an artificial standard. 
Cooperation also fosters mutual 
trust iri the supervisory relatidri : 



Second. McGreal emphasizes the 
need for a common framework and 
a similar set of definitions about 
teaching from which to work. 
Third, to accommodate time cdh : 
straints. he suggests two ways to 
increase the reliability of classroom 
observations; (1) narrowing the 
range of things to look for in accord- 
ance with previously established 
goals, arid (2 j_ increasing the 
amount of information assembled 
by the eyaluatdr prior to the obser- 
vation. McGreal also advocates the 
ustof^t^triatiye^ourceso[6AtB. to 
supplement classroom observa- 
tions. These can include self-evalu- 
ation, peer evaluation, parent 
evaluation, student evaluation, 
student periormance, and collec- 
tion of such instructionai artifacts 
asstudy guides, homework assign- 
ments, and tests. All these ap- 
proaches, he notes, require a close 
working relationship between 
teachers arid supervisors. 

The final section Jqcuses on 
training the staff and starting the 
system. As McGreal points out. the 
effectiveness of any evaluation 
system depends on the amount of 
training received by the particip- 
ants-^includingtK)th teachers and 

^yp? 17 ?*????!^ 11 !^^ skills and_ 
knowledge necessary to implement 
the system arid effectively maintain 
it. Ari appropriate training pro- 
gram, he says, should include 
goal-setting skills for both super- 
visors and teachers^ theoretical 
and practical training in the 



selected teaching focus. explana : 
t idri arid practice in the use of 
student descriptive jiata_ and [ ar- 
tifact_coiiection; and classroom 
observation and conference skills 
tor supervisors. 

4 Raths, James, and 
Hallie Presiull. 
"Research Synthesis 
on Surrirriative Evaluation 
of Teaching." Educational 
Leadership 39, 4 (January 
1982). 310-13, EJ 257910. 

Most educators agree that the 
primary aim of teacher evaluation 
should be to improve instruction. 
Nevertheless, school districts still 
need a reliable basis oii which ^o 
make decisions on tenure, promo- 
tion; reassignment or dismissal. 
For this reason, teacher evaluation 
systems heed hot only to help 
teachers improve, but also provide 
a basis forjudging their perfor- 
mance. Raths i and Preskill provide 
some clear thinking on this sum- 
mative (as opposed to formative) 
dimension of teacher evaluation. 

Judgment of teaching peHbr : 
mariee is inescapably a subjective 
process, the authors believe. ^ince 
objective or measurable standards 
do nqtexist for the various aspects 
of teaching performance. Accord- 
ingly, some evaluators tend to 
emphasize tangible patterns, such 
as instructional techniques, 
teacher behavior, and lessoirplan- 
ning. Others emphasize jsuch in- 
tangible qualities as cooperation, 
professionalism, and other general 
character traits. Whatever stan- 
dards theystress, the authors say, 
evaluators must be able to com- 
mUnicate clearly the rationale 

teachers, school boards, or courts. 

Raths and Preskill describe two 
general approaches to summative 
evaluation. The "arithmetic*' af*- 
prdach involves assigningxjuan- 
titative values to relevant dimen- 
sions of instruction, weighting 
them for importance, rating 
teachers oh each dimension, arid 
then tallying the score. This proce- 
dure is popular, they ricte. because 
it confers the appearance i of objec- 
tivity or; an essentially subjective 
process. 
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holistic approach i involves 
summary assessments made by 
one or more, evaluators and based 
bh all available ihTdrmatjon. De- 
spite the seeming objectivity of 
quantitative values, the authors 
say studies have shown that holis- 
tic judgments about teaching have 
more predictive validity than do 
arithmetic approaches. 

Three methods are described for 
making holistic judgments: (1) a 
balloting system based on paired 
^P^P^^fPi? 5 ! 1 ! ^K^te^hers are 
ranked from most^effectlve to least: 
(2) a relative rating of teachers on 
a 1 to 5 bash, with 3 denoting the 
norm; arid (3) the application of a 
similar five-point relative scale to a 
breakdown of separate teaching 
skills and behaviors. 

The authors conclude that the 
inevitably subjective and impress- 
ionistic nature of summaiive evalu- 
ation can best be controlled by 
"triangUlatiph'' piwedures, 
whereby judgments are made by 
several ^valuators— includ ing both 
supervisors and peers— and then 
cross-referenced. 

Peterson, Donovan. 
"Legal and Ethical 
Issues of Teacher 
Evaluation: A Research- 
Based Approach , r Educa- 
tional Research Quarterly 
7. 4 (Winter 1983), 6-16. 
EJ 284 820. 

Use of teacher evaluation for 




personnel decisions necessarily 
has legal ramifications. In numer- 
ous cqurt cases, teachers ti*ive_ 
challenged dismissals on grounds 
that administrators used unsound 
criteria or procedures to obtain 
evidence of incompetence. 

Peterson s guide covers the legal 
Issues that pertain to the design 
and plausibility of teacher evalua- 
tion systems. He offers helpful 
guidelines on such tssues as due 
process, discrimination, validity, 
reliability, high arid low inference 
variables, representative observa- 
tion of teacher behavior, research- 
based variablesas opposed tojiiose 
that^ are cons ?nsus-based._and 
number and length of observations. 

The test rji validity, for example, 
assesses whether teacher evalua- 
tion criteria are job-related arid 
whether evaluation systems actu- 
al? j? 16 ]*^ 1 ^^ attributes they 
claim to measure^Reliabillty refers 
to the consistency of judgments 
over time among separate bbser-_ 
vers- A variable is rated as being of 
high or low inference, depending 
"tin the degree of personal judgment 
an observer must apply to deter^ 
mine the presence or absence of the 
attribute for which the teacher is 
being rated, arid its quality. High 
inference variables tend to lack 
reliability, Peterson says, whereas 
low inference_vaurtabl^^^ 
and measurable J^hA^rs^^ricl 
to lack validity in distinguishing 
between effective and ineffective 
teaching. 

Peterson suggests that schools 



adopta three-level evaluation sys- 
tem. The first level, applying to the 
majority of experienced teachers, 
would consist of formative evalua- 
tion alone— |ot example, peer and 
self-evaluations, goai-setting. and 
an annual conference with a super- 
visor. 

The second level would be for i lew 
teachers arid those identified as 
perhaps needing improvement. 
The emphasis is still on tlie forma- 
tive but involves several observa- 
tions by trained observers to iden- 
tify problems arid develop remedia- 
tion strategies. 

The third level, sum illative evalu- 
ation , wou-d be used only tin 
trachersjudgedtobe incompetent; 
Here documentation becomes im- 
portant, along with judgments by 
more than brie person arid ample 
provision for remediation anc 
assistance. By restricting sUmrill 
tive equation to 'borderline" 
cases, schools simultaneously 
reduce the potential for legal chal 
lenges and concentrate their re- 
sources Tor adequate documenta- 
tion in the event of such challenges 
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